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Abstract. Usually the problem of drift estimation for a diffusion process is considered under the hypothesis of 
ergodicity. It is less often considered under the hypothesis of null- recurrence, simply because there are fewer limit 
theorems and existing ones do not apply to the whole null-recurrent class. 

The aim of this paper is to provide some limit theorems for additive functionals and martingales of a general 
(ergodic or null) recurrent diffusion which would allow us to have a somewhat unified approach to the problem of 
non-parametric kernel drift estimation in the one-dimensional recurrent case. As a particular example we obtain the 
rate of convergence of the Nadaraya-Watson estimator in the case of a locally Holder-continuous drift. 

Resume. Habituellement le probleme de l'estimation du drift pour un processus de diffusion est considere sous 
l'hypothese de l'ergodicite. II Test moins souvent sous l'hypothese de nulle-recurrence, car dans ce cas il y a moins 
de theoremes limites, et ceux qui existent ne s'appliquent pas a toute la classe nulle-recurrente. 

Le but de cet article est de demontrer quelques theoremes limites pour les fonctionnelles additives et martingales 
dependantes d'une diffusion recurrente generale (ergodique ou nulle). Ces theoremes permettent de donner une 
approche unifiee au probleme de l'estimation non-parametrique par noyau du drift dans le cas unidimensionnel 
recurrent. Comme exemple on obtient la vitesse de convergence de l'estimateur de Nadaraya-Watson dans le cas 
d'un drift localement holderien. 

MSC: 60G17; 60F10; 92B20; 68T10 

Keywords: Harris recurrence; Diffusion processes; Limit theorems; Additive functionals; Non-parametric estimation; 
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1. Introduction 

Consider a stochastic differential equation 

dX t =a(X t )dW t + b(X t )dt, (1) 
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where fc, er:R^R and (Wt)t>o is a Brownian motion on R. Throughout this paper we suppose that a is 
strictly positive, continuous, b is measurable, and for some constant C > and all x £ R 

<j 2 (x)<C{l + x 2 ) and \b(x)\<C(l + \x\). 

Under these conditions for each iq£R the Eq. (1) has a unique weak solution X = (X t )t>o starting from 
xq, and the corresponding semigroup is strong Feller (see e.g. [20], p. 170). Furthermore, we suppose that X 
is Harris recurrent. Recall that it means that X admits an invariant measure [i on £>(R) such that for any 
measurable / > 0, 



/>oo 

/!(/)> =^ Viet: / /(X i )d£ = oo P^-a.s. 

Jo 



In our case (more exactly if a and b are locally bounded Borcl functions and a does not vanish), the invariant 
measure [i is absolutely continuous with respect to the Lebesgue measure, see [19], p. 298. The density of 
li(dx) is given by 



[i(dx) 



■cxp^ ^{v)dv)dx. (2) 



a 2 {x) 

Recall also that a one-dimensional diffusion given by (1) is recurrent if and only if 
f x ( f v 2b \ 

J exply — (v) dv J dy — > ±oo asx^ioo. (3) 

With regard to the /i-total mass of R one subdivides Harris recurrent diffusions into two sub-classes: if /i(R) 
is finite then X is said to be ergodic, and null-recurrent otherwise. 

For such a diffusion we will consider the problem of drift estimation. Throughout this paper we suppose 
that a sample path of diffusion is observed on [0, T] and that T — > oo. 

Since we study the asymptotic behavior of diffusion processes, it is important to have at one's disposal limit 
theorems for additive functionals of the form A t = J Q f(X s ) ds or martingales of the form M t = f Q f(X s ) dW s . 
The limit theorems for additive functionals and martingales of ergodic diffusions are available on each level 
of convergence and well known, for instance if / > 0, fJ-(f) < oo, we have a LLN 

1 /"* 

lim - / f(X 8 )ds = n(f) P x -&.s.Vx 

t->oo t J Q 

and if n(f 2 ) < oo, a CLT: 



1 

ViJo 



f(X s )dW s ^Af(0,fi(f 2 )). 



For null-recurrent diffusions there is no LLN with deterministic normalization and weak convergence 
theorems are available only for a subclass of regularly varying null-recurrent diffusions, see [12, 21] and the 
book by Hopfncr and Lochcrbach [10]. As for limit theorems working in a general recurrent setting, there is 
only one result: the famous Chacon-Ornstein theorem stating that all intcgrable additive functionals (IAF) 
of X are asymptotically equivalent: V/ > 0, fi(f) < oo; Vg > 0, < n(g) < oo; 

Iof(Xs)d S /,(/) 
lim — ; = — ^— F x -a.s. vx 

t ~°° J 9{ X s) ds ^{g) 
and its integral version yields that //-a.s. 

] . m E x J*f(X s )ds _ K f) 

t^oo Ex j* g (x s )ds rig)' 
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where the exceptional set depends on / and on g. 

The literature on statistical inference for ergodic diffusions in general and drift estimation in particular 
is extensive and significant results can be obtained in this case, see for example the works by Dalalyan and 
Kutoyants [5], Dalalyan [4], Van-Zanten [22], Galtchouk and Pergamentchikov [8], Yoshida [24] to mention 
just a few; an extensive survey can be found in the recent book [14] by Kutoyants. 

The situation is different for null-recurrent diffusions. As we have mentioned, an important class of 
regularly varying null-recurrent diffusions behaving much like the ergodic ones has been thoroughly studied 
by Hopfncr and Locherbach in [10]. For such diffusions the pair (M t /^/v^,A t /v t ) converges in law as t — > oo, 
where v t = t a l(t) for some < a < 1 and l(t) varying slowly at infinity. Using these facts, Hopfner and 
Kutoyants [9] have presented one of the first examples concerning the rate of convergence and the limit 
distribution of MLE for null-recurrent diffusions. It might be possible to extend their method to the whole 
class of regular variation, but since it is based on weak convergence it cannot be used in general cases. 

Our aim in this paper is to develop some tools, for the problem of drift estimation, and more precisely, 
for the rate of convergence calculation, which would work in a general recurrent (null or ergodic) setting. 

The first idea in this direction could be derived from the Chacon-Ornstein theorem. Because all IAF 
of X are equivalent, we can consider some fixed IAF as a "time" of the system and try to work with a 
random normalisation based on this IAF. This idea was used in [7] to study the rate of convergence of a 
non-parametric kernel estimator and by Loukianova, Loukianov [17] to obtain the a.s. rate of convergence 
of the MLE. However, applying this idea in each particular case presents a real technical difficulty, and no 
concise method, based on this idea, has emerged so far. 

The second idea is based on the following observation: we are interested in the calculation of the rate of 
convergence, and the rate is a notion of tightness. All known limit theorems deal with a stronger type of 
convergence, and none (except Chacon-Ornstein) works for the whole recurrent class. However, tightness 
should be sufficient to treat the problem of the rate of convergence. Therefore the natural question arises: 
is there a property of tightness with deterministic norming for IAF, which would be true for all recurrent 
diffusions? The answer is affirmative: 

For every recurrent diffusion X there is some deterministic function vt , called in the sequel deterministic 
equivalent of X. With this function, which will be explained shortly, the following theorem holds: 

Theorem 1. For every IAF A t of X the following holds 



Such a property was first proved for recurrent Markov chains by Chen [3] , without any relation to statistics. 
In [16], using Chen's method, we extended this property to one-dimensional diffusions, and recently, in [15], 
using a new version of the Nummclin splitting method, this property was extended to the whole class of 
continuous time Harris recurrent Markov processes. 

To explain what a deterministic equivalent Vt is, we need the notion of a special function. 

Definition 1.1. A measurable bounded function / :R^M + is said to be special if for every h:M.— >WL + , 
measurable, bounded and such that pt{h) > 0, the following function U h f is bounded: 



This notion can be extended to additive functionals. 

Definition 1.2. A continuous AF A t is said to be special if for every /i:R— >R + ; measurable, bounded and 
such that pt{h) > 0, the function 
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is bounded. 

For instance, a diffusion local time is a special additive functional (SAF), see [2]; on the other hand, any 
measurable bounded function with compact support is special for a ?i-dimcnsional strong Feller diffusion, 
sec [15]. 

Observe that in the "integral version" (4) of the Chacon-Ornstcin theorem we cannot determine whether 
the result holds or not for fixed x, f and g. The most important application of special functions and 
functionals is in the fact that the assertion (4) holds for every x, and moreover, the following "strong" 
version of Chacon-Ornstein theorem (SCO) holds: 

Theorem 2. If A, B are two SAF such that \\vb\\ :=E Al _Bi>0, then for every pair (711,^2) of probability 
measures, 

^A,. |M| 

urn — = t, rr- 

t^En 2 B t ||i/ B || 

(Recall that if A t = f* f(X s ) ds with / e L 1 ^), then \\v A \\ = Kf)-) 

Now, to normalize our additive functional we take some special g with fx(g) = 1, and put 

v t = ~K f g(X s )ds, 
Jo 

where 7r is some probability measure on E. Clearly, v t is non- negative and non-decreasing. In view of the 
strong Chacon-Ornstein theorem the asymptotic order of v t depends only on the law of X. If X is an ergodic 
diffusion, then v t ~ t, if X is a null-recurrent diffusion of regular variation with index a, then v t ~ t a ; in 
particular, if X is a linear Brownian motion, then v t ~ Some less explicit examples are possible: if X is 
a drifted Brownian motion with compactly supported drift, then X is null-recurrent and Vt ~ \ft. However, 
to evaluate v t in the general case we need the semi-group of X, so vt is clearly a more "abstract" object. 

In [16] we have applied the deterministic equivalent to give a rate of convergence of the MLE for a class 
of recurrent diffusions with Holder drift. In this parametric context, the deterministic equivalent turned out 
to be an appropriate tool, because if in the proof of the ergodic case, we replace the normalization t with 
Vt, we obtain, with very few modifications, a proof for the whole recurrent class. 

In this paper we apply the deterministic equivalent to the calculation of the rate of convergence in non- 
parametric drift estimation. We suppose, as usual, that the drift function of X is locally a-H61der near some 
point xq € E, and we estimate b(xo) by the Nadaraya- Watson estimator 6^* t given by 

hh _ J^((X s -x )/h)dX s 
X0>t J^((X s -x )/h)ds ' 

Here is a non- negative smooth function with compact support. The process 6j t is composed with some 
bandwidth h t . In the ergodic setting h t is a deterministic function of t which can be found using a standard 
optimization procedure. A similar procedure, when guessing the optimal bandwidth in the general recurrent 
case, gives ht as a function of Vt- As we have noticed above, v t cannot be calculated in general, but we need 
an estimator, depending only on the observations. So we also express the bandwidth (and the rate) in terms 
of any random and observable IAF Vt, equivalent (in the sense of Theorem 1) to v t . 

Therefore, the objects appearing in this problem and those for which we need to study the asymptotic 
behaviour, are additive functionals A\ and martingales M/ 1 composed with some deterministic or random 
bandwidth ht, namely 
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To be precise, with a view of rate calculation, we want to show some tightness property for these objects, 
with some normalization depending on vt- When the bandwidth is random, A\ l is no longer an additive 
functional, and M t 4 is no longer a martingale. It would be difficult to obtain their asymptotics directly, using 
stochastic calculus methods, because these objects do not have the usual stochastic structure. To avoid this 
difficulty, we firstly show some uniform in h tightness property for A\. Namely, we show that \A\ admits 
vt as uniform deterministic equivalent (Theorem 5). We then derive a "diagonal" tightness property for 
A^ and M/ 1 *, Corollaries 3.1 and 3.2. All results regarding the limit behavior of A\ l and are given m 
Section 3. 

The main technical tools of our work are two theorems on the local time Lf of X, namely, the uniform 
in x deterministic equivalent (Theorem 3), and the uniform in x strong Chacon-Ornstein Theorem 4. This 
is the content of Section 2. 

Statistical applications arc given in Section 4. Theorem 6 gives the rate r t = v t q /( 2q+1 ) specified by the 
bandwidth h t = v t 1 /( 2 "+ 1 ) ^he random rate Rt and the bandwidth H t are given by the same theorem and 
are: R t = ^ q/(2q+1) ; H t = V - 1/{2a+1) . 

Expressed in deterministic terms, our result agrees with the well-known rate r t = t^ Q /( 2o! + 1 ) i n the crgodic 
case of this model, as well as with those of Delattre and Hoffmann [6], where the minimax rate r t = £-«/( 4q + 2 ) 
specified by the bandwidth h t = t _ ( _1 /( 4a + 2 )) was found for the model of drifted Brownian motion with 
compactly supported drift. Recall that this diffusion is null-recurrent with vt ~ y/i. Our random expressions 
agree with those of Delattre, Hoffmann and Kessler [7], where the Nadaraya- Watson estimator was studied 
in a minimax setting by different methods and both rate and bandwidth were expressed in terms of one 
fixed IAF: the occupation time. 

Notice finally that for multi-dimensional diffusions, it is possible, using the Nummelin splitting, to get a 
result similar to Theorem 6, but only with deterministic rate and bandwidth [15]. 

2. Uniform deterministic equivalent and uniform Strong Chacon Ornstein theorem for the 
local time 

Recall that we consider a recurrent scalar diffusion X given by the Eq. (1) and subject to the same as- 
sumptions. As usual, \i denotes the invariant measure of X and (Lf ) t >o the Tanaka-Meyer local time at the 
point Let it be some probability on K and Vt = E w J"* g{X s ) ds be a deterministic equivalent of X. We 

suppose g measurable, bounded, compactly supported and such that fi(g) = 1. In this section we derive a 
uniform deterministic equivalent for (i^) f >o (Theorem 3), and we show that the local time obeys a uniform 
strong Chacon-Ornstein theorem (Theorem 4). 

Proposition 2.1. For each y € R, H^Lfll = fj2 (y)/ i (j/); where /i(y) is the density of the invariant measure 
fi(dy) with respect to Lebesgue measure. In particular, < < °°- 

Proof. Denote by p(t; x, y) the density of the transition function of X with respect to the invariant measure 
/i. This density exists and may be taken to be positive, jointly continuous in all variables and symmetric, 
that is p(t,x,y) =p(t,y,x), see [11], p. 149. 

On the other hand, the invariant measure /x has itself a density with respect to Lebesgue measure, 
fi(dx) = fJ.(x) dx, where 




We have the following relation between the local time and the density p(t;x,y): 



E x Lf = o- 2 (y)fi(y) / p(s;x,y)ds, 
Ja 



(0) 
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see [1], p. 21 or [11], p. 175. The factor o- 2 (y)fi(y) arises since L\ is the Tanaka-Meyer local time and not 
the Ito-McKean local time. 



\\v L y\\: = ¥.^L\ = (T 2 {y)ii{y) I fi(x) dx I p(s]x,y)ds 

Jm Jo 

= v 2 {y)Kv) ds P(s;y,x)fx(x)dx = a 2 (y)fj,(y). 



□ 

Remark 2.1. The condition > implies, according to Brancovan [2], that Vx £ R, 'P X {L V 00 = oo) = 1, 

and that L y is a SAF. 

Theorem 3. Let K Cl be some compact set. For any initial probability v the following limit holds: 

( 1 L x L x \ 

lim liminf P„ — < inf — < sup — < M = 1. 

A/-+00 t—too \M x&K V t X £K Vt J 

Before we prove this theorem let us discuss the structure of the proof. The basic idea is of course to show 
some kind of uniform continuity property of the family of normalized local times. For each point xq £ K, we 
do it for some small interval [xq — 8; xq + S] with 6 > depending on xq. Using this we will start by proving 
the Theorem 3 for this small interval and then for the compact K . By definition for y < z, 

^t-R = {Xt y)+ {Xt Z )+ l( Xo y)+ - (X - z)+] 

- l{y<x 3 <z}<r(X s )dW s - l {y< x 3 < z} b(X s ) ds a.s. (7) 
For t > 0, y < z denote 

Mf z = [\ {y<Xs < z} a(X s )dW s . (8) 
Jo 

Using occupation formula and boundedness of |6|/cr 2 on K , with C > 0, we can write 

\L\-Ll\ <2S+\M't z \+C f L x t dx a.s. (9) 



Now to prove Theorem 3 we need Lemmas 2.1-2.4. The first one is axillary to the second. The second 
deals with the martingale term of the decomposition (9) , the third one with the last term of the (9) and 
Lemma 2.4 gives the equicontinuity property we need. 

Put Ks ■= [x — 5; xq + 5]. 

Lemma 2.1. For any initial probability v the following limit holds: 
/ rxo+S \ 

lim lim sup P„ / {L x t f dx > Mv 2 t = 0. 
a/^oo \J xo s J 

Proof. Fix some x < xq. Observe that with some constant C > we have 

m - L?f <c{{28f + (j\ {x<Xs < Xo} a(X s )dW s } + (j\ {XaeKs} \b(X s )\ds 
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Observe that thanks to conditions on b and a in the Introduction they are bounded on K$. For t > 0, 
introduce the notation 

At = f l {Xs eK s} \b{ X s)\te- (10) 
Jo 

Since A t is an IAF (and even SAF), with notation (10) we have 

(L? - LT?l At < mvt < C{(26) 2 + {M'r a f+^Mv 2 t }. 
Notice that 



Jo 



Thanks to the condition on a the last expectation is an expectation of SAF. and we have denoted it by Vt 
to stress the fact that it is a version of the deterministic equivalent of X . Observe that the same reasoning 
(with changing M x ' x ° on M x °' x ) and the same estimations hold for x > xq. Finally, for all x £ Kg it holds 



E„(L? - if)' l At < mvt < C{(2Sy +v t + VMvf}. 
Now, to prove the lemma we use the following decomposition: 

/ rXo+S \ / rXo+S 

P„ / (L'tf dx > Mv 2 < P„(85(L x t ) 2 > Mv 2 ) + P„ 4 / (L? - L^ ) 2 da; > Mv 2 ; A t < <fMv t 

\J Xq — S / V Jx — 8 

+ -p v {At>VMvt). (11) 



With some constant C, for the second term of the right-hand side expression of (11) we have: 

fXg+S 



P„ U f ° (L x - L x t ) 2 dx > Mv 2 ; A t < ^Mv 

Wv 2 J x _ s 1/(4 ~ 4 ' 1 A t <mv t dx < c - 



'H Jx -S Mvj 

According the strong Chacon-Ornstein theorem there is a limit of v t /v tl as t — > 00. Using this, together 
with Theorem 1 for the other two terms on the right of (11) we obtain Lemma 2.1. □ 

Let V be some IAF of X, such that \\vv\\ > 0, and for simplicity we take ||^v|| =1- Vt W1 U pl a Y the role 
of normalization for L\ — L\ to obtain the uniform continuity property we need. 
In the sequel of this section denote 

n M ,t ■= (Lffdx < M 2 v 2 ; jj<Vt< WtAfj. 

Let {(Mf' z ; t > 0);xo — S < y < z < xq + 8} be a family of martingales defined by (8). 

Lemma 2.2. For every u > and M > there is some r = r(w, M, u, K$) such that a.s. for t> t it holds 



\M t 

y,zGK s n 71 1 
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Proof. Fix some u > and M > 0. Denote 

f \ M t' Z \ 
B n = < sup sup — - — ln Mt > 



T 7 *-"M,t — 1/4 

,{t;e n - 1 <v t <e n }{a,beK s } v t v t ' 

We will use the Nishiyama martingale inequality to bound the probability of B n and then the Borcl-Cantclli 
lemma to prove Lemma 2.2. The statement of the Nishiyama martingale's inequality for our setting is given 
bellow, for a proof of this result sec [18] and for its generalization see [23]. 

Let p be a metric on Kg, denote by ||M|| Pit the quadratic p-modulus of the family of martingales {M\' z ;y < 
z,y, z £ Kg}. For every t > 0, this modulus is given by 

i«i„,=,«. 

y^z p(y,z) 

As usual, denote by N(K$, p,e) the metric entropy of Kg with respect to the metric p, i.e. the minimal 
number of p-balls of radius e, needed to cover Kg. The Nishiyama inequality for tail probabilities [23] states 
that 



P sup sup \M*''\1{\\m\\, >x)< 2e~ x ^ ™« \ 

K t<T p(y,z)< V J 

where C>0,7y>0,K>0 are constants, r is a finite stopping time and 



4>{r,) = / ^\og N (Kg, p,e)de 



is supposed to be finite. 

Before we use the Nishiyama inequality we need to evaluate the quadratic p-modulus on the set Qmj- 
Wc have 



(M?' z ) = j l {[y . A} (X s )a 2 (X s )ds= j L*dx<V^dJ ^L-Ydx 



xq+8 

2 



and then 



(Mf' z }la M , t <V^yMv t . 

If we take p{y,z) = $ 'z — y as a distance on Kg, it is easy to see that <j>(6) is finite and we obtain a bound 
for quadratic p- modulus ||M|| p .t restricted on f?M,t'- 

/(M v ' z ) 

||M|| P|t ln Mtt =su P v ) \' lau.t < V^t- 
y±z p{y,z) 



Then 



|Mf 2 | u 
P(B„)=P sup sup J_pi n >_ 

\{t;c"-^<v t <c"}{y,z£K 5 } H v t ' 

UV t 



<P sup sup \Mr\> fTj ; ||Af || P)t < \jMvt 

\{(;e»- 1 <«,<e"}fej6if j } Mv t ' 



<P sup sup |MH>^ 7i ;||M||,, e »<VM^ 

\{t;e«-i«, t <e™} {v,zBK„} Me™/* 



< 2exp 



M 3 e( 3 / 2 )"</) 2 ((5)y' 
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For all u > 0, for all M > the series P(B n ) converges, and the lemma follows by Borel-Cantclli. □ 

Note that v 4 J 4 is enough for our needs, but actually one can replace v\^ 4 in the lemma with vf without 
changing the proof. 

Lemma 2.3. For every M > there is some t = t(u>,M) such that a.s. for t > t(w,M) it holds 



1 f'Lf 



sup / — dxln M < A 

y^zeK s z~y J y V t 

with some constant A > 0, independent of M,t,u>. 
Proof. Let y < z in K$. 



r dxl OMit <(z-y)\^+sup > \ " ln M| , ■ (12) 
V 7 * L "t y,z v t J 

Using the Chacon-Ornstcin theorem and Proposition 2.1, almost surely > y^j- = o- 2 (x)[i(x) := Iq with 

Iq independent of u). Using the definition (7) and the notations (10) and (8) we can write: 

iLf-Lfl 26 , |Mf z l , A t 



* lfi M ,t < 77 H 77 "Ifljif.t + 77- 



V t - Vt Vt M,< V 

Following Lemma 2.2 there is some t = t(u>,M), such that for t > t(u>,M) 
\Vt-m 28 8 A t 

where we have denoted l\ = lim^oo From (12) and (13) we deduce that the lemma holds with A = 
2(l Q + h). □ 

Lemma 2.4. For every M there is some t = t(u),M), such that a.s. for t > t(u),M) it holds with some 
constant C > 

sup M^Ml la <5C. 

Proof. We apply Lemmas 2.2 and 2.3 to the decomposition 
\tit-U\ 25 \Mf z \ ( z L x 



< - + 4^1%, + / 77 ln A 



V t - V t V t J y V t 

where in Lemma 2.2 we take u = 5. □ 

This property is weaker than the restricted uniform continuity of normalized local time, but enough for 
what we want to do. Namely, it provides the necessarily "uniform" argument to show that mf K L ' is not far 

from for K = [x$ — <5; x^ + 5] with a small 5. Now we can prove Theorem 3: 

Proof. We firstly prove the theorem for K$ = [xq — 5; xq + 6] with some small S depending on xq. 

Let t = t(u),M) of Lemma 2.4. As previously, let Iq > be the almost sure limit Iq = lim^oo -4^-. Let 
t'(u) = mi{t > 0; l /2 < Lf /V t < 2l Q } and r = max(t, t'). Fix M > and 5 = 5(x ) > in such a way that 
the following holds with C > from Lemma 2.4 

— < - - SC < 2l + SC < M. 
M ~ 2 ~ 
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We have 

— < inf — < sup — < M 

M x<EK s V t x£ K 5 V t 

( 1 Lf° , Lf- L x ° L x ° Lf - Lf° 

>P 77^+ lnf t r ^-rr+ su p t r - Mn Qm >* nt>T 

\M V t xeK 6 V t V t xeK s V t 

(i txo t xo 

jj<-^--sc<-±- + sc<Ainn A unt>T 

> p ( < - sc < 2l + sc < m n n M t n t > r 

\M 2 

>P(f2 M ,tnt> T ). (14) 

For the last expression we have 

lim liminfP(/2 M i Hi > r(w,M)) = 1. 

Af— >oo t— too 

Note that we can easily change Vt with Vt in (14), so the theorem for K$ is proven. 

Now let K be some compact in the state space. For each x £ K we choose S x > such that the theorem 
with K(x) = [x — S x ;x + S x ] is true. Then by standard compactness arguments we obtain the theorem for 

k. a 

The strong Chacon-Ornstein (SCO) theorem states that if At is a SAF and vt is a deterministic equivalent 
associated to Xt, then K„At ~ Ut, as i — > oo for any initial distribution za The following theorem is a uniform 
version of SCO theorem for a local time. 

Theorem 4. The sequence of real function gt(y) = E " v ^ t converges uniformly on Kg to the function l(y) = 
° 2 (y)V-{y) ast^oo. 

Proof. The point by point convergence follows from the strong Chacon-Ornstein theorem and Proposition 
2.1: 

hm^ = JlM =(r2(y)/i(2/) . (15 ) 

t— too Vt 1 

To show the uniform convergence we will prove that the family { " ' } t> o is equicontinuous on K$ = 



[x - 5, x + S] . 

\l\ = (X t - y) + - (X - y)+ - J* l {Xs>y} dX s 



hence 

1 



2 (E„L y t - E u Lf) = E v ((X t - y)+ - (X t - z)+) - E„((X a - y)+ - (X Q - z)+) 

-E„ / l {y<Xa < z }b(X s )ds. 
Jo 

So, as an initial inequality we can take 

±\E„L y t -E„L z t \<2\y-z\+E„^ l {y<Xs < z} \b(X s )\ds. (16) 
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We rewrite this using occupation formula, and divide by vt'. 
\E v L y t -E v Lj\ < 2\y-z\ { f* \b{x)\ E^Lf ^ 



2v t v t J y o 2 {x) v t 

From (16) we have for all y,z £ Kg 



±\E v ti> - E v L* t \ <4S + E„ J l {XseKs} \b(X s )\ ds = 46 + v t . 



(18) 



Since b is locally bounded and X is strong Feller, the second term on the right-hand side in (18) is an 
expectation of special additive functional which was denoted by v t . Hence, for all x £ Kg 

E v Vf ^ \E V L% - E„L x t a | | E„L?° < 45 + v t + ~v t 
v t ~ v t v t ~ v t 

where we denoted E„L X " by vt to stress the fact that E V L X ° is an expectation of another special additive 
functional. According to the strong Chacon-Ornstcin theorem the right-hand side expression converges to 
some constant, hence for some constant M > and t > large enough 

F T x 

sup — — ^- < M. (19) 

xe[x -S,x +5] v t 

Now we insert (19) in (17): for some K > and a large enough t 

KLy-E„L ll< \y- I \ +M \m\ ly _ z]<K]y _ z] 

v t V t [x -S,x +S] & W 

and uniform convergence follows from the Ascoli lemma. □ 

Corolarry 2.1. Let h tl t>0, be some deterministic function with range in [0, 5] and such that lim^oo h t = 
0. For all y £ [-1; 1] lim^^ - JL ^ = n(x )a 2 (x ) . 

Proof. We use the theorem about continuous convergence, see [13], p. 194: A sequence of continuous func- 
tions (/„) defined on some metric compact X converges uniformly to / if and only if for any sequence (x n ) 
the condition linin^oo x n = x implies limn^oo f n (x n ) = f(x). □ 

Corolarry 2.2. Let <p be measurable bounded and compactly supported in [— 1;1], Let h : K + — > R + be such 
that limt—foo ht = 0, and ip :R — > M + continuous in xq. Then for any initial probability v the following limit 
holds: 

v E„J^((X s -x )/htMX s )ds f 

lim y =tp{xo)fj,{x ) / <p{y)dy. 

t^ 00 htvt J ffi 

Proof. Using the occupation formula 

E,J^({X s -x )/h t )^(X s )ds _ E u f M 4>((x-x )/h t )(tl>(x)/a 2 (x))L?dx 



h t v t h t v t 

ip(x + h t y) 
' a 2 (x +h t y) 



where we have denoted ft(y) = E " L \, ■ Using Corollary 2.1 



Vt 



Vy £ [-1; 1] lim f t (y) = fi(x )(T (x ). 

t — »oo 
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Also using (19) for some M > and t large enough we obtain sup^ ^ ft < M. The dominated convergence 
theorem concludes the proof. □ 

3. Limit theorems for AF and martingales composed with deterministic or random 
bandwidth 

Let 0:M^R + be of class C 1 , with support in [—1,1] and J,(f>(x)dx = 1. Let X be a Harris recurrent 
diffusion subject to the assumptions of (1). Denote for h > 

(for h = sec (21)), and for h > 

M? = J </>(Zzj^)<T(X a )dW.. (20) 

Since (f> is C 1 , there is a continuous modification of \A\ and if cr is bounded, of M*, h <G]0,<5]. We adopt 
these modifications in what follows. As before, for <5 > 0, Kg = [xq — S, Xo + 6] and v t denotes a deterministic 
equivalent of X. 

In this section we obtain asymptotic results for A\ and Mf* composed with deterministic or random 
bandwidth ht- 

Theorem 5. For any initial probability v , 

( 1 A h A h \ 

lim liminf P„ I — < inf —t- < sup —t- < N ) =1. 

jv->oo t^oo \N he[o-,s] hv t he[0;8] hv t J 

Proof. By occupation formula, there is a P negligible set J?o, outside of which we have 

L x t 0+hy Ay (21) 



_jx +hy i 

! cr 2 (a;o + fry) 

for alii > and every /ig [0,(5]. 

With some < C < oo, outside of J?o, we have 

— inf I7 t < inf < sup \a^ < C sup L% 

C x£K s he[0;8] h he[0;8] i£X f 

for all t > 0. Fix some A > 0. 

P — < inf p- < sup — *- < N >P — < inf < sup — < — 

ViV he[o ; «] hv t h£[o;S]hv t J \N xck s v t x&Ks v t C y 

and the lemma follows from Theorem 3. □ 

Corolarry 3.1. Let Ht(oj),t > 0, be some measurable process with range in [0, 5} . Denote by A?* the com- 
position of A't(uj) with Ht(ui). Then for any initial probability v, 

( 1 A Ht 
lim liminf P — < < N ) = 1. 

N->oo t-*oo \N H t v t 
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Lemma 3.1. Let h : R + — >]0, 6] be such that lim^oo h t — 0. Let (Mj^) be the martingale defined by (20) and 
M/ 1 * is a composition of this martingale with h t . Then for any initial probability v, 



lim liminf I— N < 



N—>oo i^oo 



VhtVt 



<N =1 



Proof. (M^';s e [0,i]) is a martingale, so we have by the Markov inequality and quadratic variation's 
definition 



VhtVt 



>N)< 



v \M^f _ E„ J* ^((Xs - x )/h t )a 2 (X s ) ds 



N 2 h 



/'V 



N 2 h 



Lemma 3.1 follows from Corollary 2.2 and the fact that a > 0. 



□ 



Lemma 3.2. Let v be some probability on R, (Ht)t>o be adapted random processes with range in ]0,6] and 
h t some measurable deterministic function with range in ]0-,6], such that 

lim liminf P„ [ — < ^ < N ) =1. 

JV-^oo t^oo \N ht J 



Th 



en we nave 



lim lim sup P„ 



M. 



H, 



M' 



VhtVt VhtVt 



> N =0. 



Proof. Under the assumption of the lemma we can differentiate with respect to h the stochastic integral 
M/\ For h € ]0, 6} its derivative is given by: 



d_ 

dh' 



-Mf 



X s - x 



<r{X s )dW s 



where we denote ^(^f^-) = -H 2 ^ 3 -) 2 ^ 2 ^ and = J * ip(^^o)a(X s ) dW s . Remark that and 
M t have the same structure in the sense that ^{x) is also supported in [—1, 1] and of class C 1 (R). We will 
use the representation 



M, Ht -Mb = / -Nfdh. 



1 



We have 



M. 



ht 



MS 



VhtVt \/htv t 



>n,^<^<Vn 

VN h t 



<P 



1 



IhVn 



Vh t v t Jht/VN h 



-\N?\dh>N)< 



1 1 



=E„ 



N VhtVt JhtlV^h 



h t VN 



1 1 



htVN 



1 EJN t h \ 



dh. 



N VhtVt Jht/VN Vh Vh 
Using the Cauchy-Schwarz inequality, quadratic-variation and occupation formulas 



(22) 



K\N t h \ < t f%M? _J*» So ^ 2 (( X s ~ xo)/h)a 2 (X s ) ds 
Vh ~ 
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According to Theorem 4 the sequence of real function gt(y) — ► ^- - converges uniformly on to the 
function Z(y) = n(y)a 2 (y) as f — * oo. By assumptions I is bounded on K$, so there is some A > and some 
to such that for all t > to 

f W T x o+hy 

/ ^(^(ao + M— d y<^ 

We insert now this estimation in (22): 

1 

/N " h. 



M t Ht Mt* 



-i i rh t VN 1 / /■ ]ff j-x +hy 



N Vh Jh t /</W Vhy Jr 1H 



and the lemma follows by taking limit as iV — > oo. □ 

From Lemmas 3.1 and 3.2 we deduce Corollary 3.1. 

Corolarry 3.2. Let (H t )t>o be adapted random processes with range in ]0,<5]. For any initial probability v, 

( M Ht \ 
lim liminf P„ -JV < J. < iV = 1. 

JV^oo t^oo y \fh t V t J 

4. Application to non-parametric estimation 

We observe until time t a sample path of the processes X, weak solution of (1). Suppose that the drift 
coefficient b is unknown and for xq £ K we want to estimate b(xo). Let 8 > be such that [xo ~ 8 ; xq + 6] C R. 
We suppose that a and b satisfy the assumptions of (1), and also that for some a e]0, 1] and 7 > the 
function b satisfies the local Holder condition: 

\b(x)-b(x )\ 
SU P ~~ i kT~ - 

xe[x Q -5,x +5] \X—Xq\ 

Let (j):R — > R + be of class C 1 with support in [—1, 1] and J <j>(x) dx = 1. For h > we consider a family 
of Nadaraya- Watson estimators 

%h _f*cf>((X s -x )/h)dX s 

T 



fZ<j>((X.-xo)/h)da 



Let Vt be some given IAF of X such that ||iv|| > 0, and let vt be its deterministic equivalent. 
Denote 

Ht = v - m2a+ l) AS . ht=v -l /{ 2 a+ l) A§ . rt = w a/(2a+l). ^ = y a/(2a + l) _ ^ 

To estimate b(xo) we use b^ t , so if t and i? t play the role of "random bandwidth" and of "random rate," 
respectively. 
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Theorem 6. R t is an upper rate of convergence ofb^* t to b(xo) 
lim limsupP^I&f * - b(x )\ > K) = 0. 

K—>oo t^oo 

Proof. Using (1) we can write: 

r h H t u ,^ \JoH(X s -x )/H t )<T(X s )dW s \ J^((X S ~ x )/H t )\b(X s )-b(x )\ds 

* t - 0(Xq)\ < — j h — j 

f*<j>((X s -x )/H t )ds J^((X s -x )/H t )d S 

< |/ o t 0((X s -x o )/g t ) f T(X s )dM/ s | | ffQ 
S*^X a -x )/H t )da ' 7 ' 

= |/>((X s -:r )/ff>(X s )d^| x ^ ^ 1 | ^ Q 

vOT X /*0((Jf.-a:o)/fl't)ds X \/OT 7 " 

where in the second line we have used the Holder property of b. Let 
Q i = l \J^((X s -xo)/H t )g(X s )dW s \ < ^ 1/4 



if 1 /* - wjfl; 



fi 3 J 1 <^< K l/4 

' A' V4 - Wt - 



and 



fit k = fi* is n /2? n j7? 



We have 

P v (R t x |6^, t - 6(ar )| > K) < P„ (t,^ 2 "^ (V /2 ^= +7^ 

= P V (A' 1 / 2 + 1 > K) + Pu(Of K ), 



where we used the fact that hf = v t q /( 2q+1 ) f or l ar g e t, \Jv t h t = y v\ 1 /( 2q+1 ' = w ^/( 2q + 1 ' Using Corollaries 
3.1 and 3.2 we obtain 

lim limsupP(J?j K ) = 

K-too t^oo 

and Theorem 6 follows. □ 

In the previous theorem we express the bandwidth and the rate as a function of additive functional Vt of 
observed process X t . It is easy to see that the theorem holds if we change H t with h t and Rt with r t given 
by (23) which are both deterministic. 
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